WHAT IS CLAIMED IS: 

1. A method of analyzing gene expression, gene annotation, and sample infomiation 
in a relational format supporting efficient exploration and analysis, the method comprising: 

providing a data warehouse which comprises a gene expression database for storing 
quantitative gene expression measurements for tissues and cell lines screened using various 
assays; a clinical database for storing information on bio-samples and donors; and a firagment 
index for biological properties for DNA firagments; 

receiving a query regarding gene expression of one or more DNA fragments; determining 
the level of gene expression of the one or more DNA fragments; correlating the level of 
gene expression with the clinical database and the fragment index; and 
displaying the results of said correlation. 

2. The method of claim 1, wherein the data warehouse is constructed in a star 
relational schema. 

3. The method of claim 1, wherein the data warehouse is constructed in a 
snowflake relational schema. 

4. The method of claim 1 , wherein the analysis of gene expression, gene annotation, 
and sample information further comprises identifying two sets of DNA 

fi-agments: those that are consistently expressed within the sample set, and those that are 
consistently not expressed. 

5 . The method of claim 1 , wherein the analysis of gene expression, gene 
annotation, and sample information further comprises a gene signature differential 
analysis which compares two gene expression signature and derives four sets of DNA gene 
fi:agments: those that are in both the first gene signature's present gene set and the second's 
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absent gene set, those that are in both the first gene signature's absent gene set 

and the second's present gene set, those that are in both present gene sets, those that are in 

both absent gene sets. 

6. The method of claim 1, wherein the analysis of gene expression, gene 
annotation, and sample information further comprises a fold change analysis which 
quantifies the change in expression for differentially expressed genes between pairs of 
DNA fragments. 

7. The method of claim 1, wherein the analysis of gene expression, gene 
annotation, and sample information further comprises an E Northem analysis which identifies 
DNA fragments with regard to a pair of user-selected percentiles over the values for a sample. 

8. A computer system comprising 

a data warehouse which comprises a gene expression database for storing quantitative 
gene expression measurements for tissues and cell lines screened using various assays; a chnical 
database for storing information on bio-samples and donors; and a fragment index for biological 
properties for DNA fragments and 

a user interface capable of receiving a query regarding gene expression of one or more 
DNA fragments and displaying the results of a correlation of the level of gene expression with 
the clinical database and the fragment index. 

9. The computer of claim 8, wherein the data warehouse is constructed in a star 
relational schema. 

10. The computer of claim 8, wherein the data warehouse is constructed in a 
snowflake relational schema. 

1 1 . The computer of claim 8, wherein the analysis of gene expression, gene 
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2 annotation, and sample information further comprises identifying two sets of DNA fragments: 

3 those that are consistently expressed within the sample set, and those that are consistently not 

4 expressed. 

1 12. The computer of claim 8, wherein the analysis of gene expression, gene 

2 annotation, and sample information further comprises a gene signature differential analysis 

3 which compares two gene expression signature and derives four sets of DNA gene fragments: 

4 those that are in both the first gene signature's present gene set and the second's absent gene set, 

5 those that are in both the first gene signature's absent gene set and the second's present gene set, 
M 6 those that are in both present gene sets, those that are in both absent gene sets. 

^3 1 13. The computer of claim 8, wherein the analysis of gene expression, gene 

pi 

^ 2 annotation, and sample information further comprises a fold change analysis which quantifies the 

Jg 3 change in expression for differentially expressed genes between pairs of DNA fragments. 

0 1 14. The computer of claim 8, wherein the analysis of gene expression, gene 

1 ; i 

P 2 annotation, and sample information further comprises an E Northern analysis which identifies 

w, 

Q 3 DNA fragments with regard to a pair of user-selected percentiles over the values for a sample. 

1 1 5. A computer program product comprising a computer-usable medium having 

2 computer-readable program code embodied thereon relating to a data warehouse which 

3 comprises a gene expression database for storing quantitative gene expression measurements for 

4 tissues and cell lines screened using various assays; a clinical database for storing information on 

5 bio-samples and donors; and a fragment index for biological properties for DNA fragments; 

6 the computer program product comprising computer-readable program code for effecting 

7 the following steps within a computing system: 

8 providing an interface for receiving a query regarding gene expression of one or more 
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DNA fl-agments; 

determining the level of gene expression of the one or more DNA fragments; 
correlating the level of gene expression with the cUnical database and the fragment S 
index; and 

displaying the results of said correlation. 

16. The computer program product of claim 1 5, wherein the data warehouse is 
constructed in a star relational schema. 

17. The computer program product of claim 15, wherein the data warehouse is 
constructed in a snowflake relational schema. 

1 8. The computer program product of claim 15, wherein the analysis of gene 
expression, gene annotation, and sample information further comprises identifying two sets of 
DNA fragments: those that are consistently expressed within the sample set, and those that are 
consistently not expressed. 

19. The method of claim 15, wherein the analysis of gene expression, gene 
annotation, and sample information further comprises a gene signature differential analysis 
which compares two gene expression signature and derives four sets of DNA gene fragments: 
those that are in both the &st gene signature's present gene set and the second's absent gene set, 
those that are in both the first gene signature's absent gene set and the second's present gene set, 
those that are in both present gene sets, those that are in both absent gene sets. 

20. The computer program product of claim 1 5, wherein the analysis of gene 
expression, gene annotation, and sample information further comprises a fold change analysis 
which quantifies the change in expression for differentially expressed genes between pairs of 
DNA fragments. 
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21. The method of claim 15, wherein the analysis of gene expression, gene 
annotation, and sample information further comprises an E Northern analysis which identifies 
DNA fragments with regard to a pair of user-selected percentiles over the values for a sample. 



